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Input file flhl5625cons; Output File 15625tr 

Sequence length 2286 v ^ 



CXTIXXXTTAGCTTTGAGTCCAGTGTTTGAA^ 
AGAGCACTCAAGACTTTAC^ 



AAAGGAAAATACCAGATGCXIACTCTGCAGGCTGC^^ 

MQAVDNLTSAPGNT 14 ^2 

AGTTATCAGGTAACCAACAAGAA ATG CAA GCC GTC GAC AAC CTC ACC TCT GCG OCT OGG AAC ACC 42 



SLCTR DYKITQVLFPLLYTV 34 

ACT CTG TGC ACC AGA GAC TAC AAA ATC ACC CAG GTC CTC TTC CCA CTG CTC TAC ACT GTC 102 

LFF VGL I TNG LAMR I F FQ I R 54 

CTG TTT TTT GTT GGA CTT ATC ACA AAT GGC CTG GCG ATG AGG ATT TTC TTT CAA ATC OGG 162 

SKSNFI I FLKNTVISDLLMI 74 

AGT AAA TCA AAC TTT ATT ATT TTT CTT AAG AAC ACA GTC ATT TCT GAT CTT CTC ATG ATT 222 

LTFP FKILSDAKLGTGPLRT .94 

CTG ACT TTT CCA TTC AAA ATT CTT AGT GAT GCC AAA CTG GGA ACA GGA CCA CTG AGA ACT 282 

FVCQV'TSVIFYFTMYIS. ISF 114 

TTT GTG TGT CAA GTT ACC TCC GTC ATA TTT TAT TTC ACA ATG TAT ATC AGT ATT TCA TTC 342 

LGLI'TIDRYQKTTRPFKTSN 134 

CTG GGA CTG ATA ACT ATC GAT CGC TAC CAG AAG ACC ACC AGG CCA TTT AAA ACA TCC AAC 402 

PKNLLGAKILSVV IWAFMFL 154 

CCC AAA AAT CTC TTG GGG GCT AAG ATT CTC TCT GTT GTC ATC TGG GCA TTC ATG TTC TTA 462 

LSLPNMILTNRQPRDKNVKK 174 

CTC TCT TTG CCT AAC ATG ATT CTG ACC AAC AGG CAG CCG AGA GAC AAG AAT GTG AAG AAA 522 

CSFLKSEFGLVW HEIVNYIC 194 

TGC TCT TTC CTT AAA TCA GAG TTC GGT CTA GTC TGG CAT GAA ATA GTA AAT TAC ATC TGT 582 

QVIFWINFLIVIVCYTLITK 214 

CAA GTC ATT TTC TGG ATT AAT TTC TTA ATT GTT ATT GTA TGT TAT ACA CTC ATT ACA AAA 642 

ELYRSYVRTRGVGKVPRKKV 234 

GAA CTG TAC OGG TCA TAC GTA AGA ACG AGG GGT GTA GGT AAA GTC CCC AGG AAA AAG GTG 702 

NVKVFIIIAVFFICFVPFHF 254 

AAC GTC AAA GTT TTC ATT ATC ATT GCT GTA TTC TTT ATT TGT TTT GTT CCT TTC CAT TTT 762 

ARIPYTLSQTRDVFDCTAEN 274 

GCC CGA ATT CCT TAC ACC CTG AGC CAA ACC OGG GAT GTC TTT GAC TGC ACT GCT GAA AAT 822 
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TLFYVKESTLWLTSLNACLD 294 

ACT CTG TTC TAT GTG AAA GAG AGO ACT CTG TGG TTA ACT TCC TTA AAT GCA TGC CTG GAT 882 

PFIYFFLCKSFRNSLI SMLK 314 

COG TTC ATC TAT TTT TTC CTT TGC AAG TCC TTC AGA AAT TCC TTG ATA AGT ATG CTG AAG 942 

CPNSATSLSQDNRKKEQDGG 334 

TGC CCC AAT TCT GCA ACA TOP CTG TOC GAG GAC AAT AGG AAA AAA GAA GAG GAT GGT GGT 1002 



DPNEETPM * 
GAC OCA AAT GAA GAG ACT OCA ATG TAA 

ACAAATTAACTAACX3AAATATTTCAATCTCTTT 

GAOGAAGAAGCAACTAAGTTAATAATAATGAC^ 

CTATGAAAAGCTATCTTAAAATATAGAAAACTAATCTAAACTCT 

GTCATGCTGCATGCAAAACTACACAGAATTC^ 

TAATTTTTAAAATACATTATOGTTCACAATTTTATT^ 

TCTTACCAAAAATGATAGTTAAAATGTATATATATCCT 

AATTTAAGTAAGTQGGATACACAAAGAATAATAACrATO 

ATTGAAACTGTATTTGATTtX^COT 

GAGAAGAAATATOSAAGTCATTAAAATAAGGAGACr^ 

CTTAATTCTAGAGAAACTAGTTTTACT^ 

GCTTCTAGAAAATAGCTGCTAATTAGGTra 

TTGCACAGC^TAACTACTGAGAGGA 

AATTTACATTAAACTCTAAAAAAAAAAAAAAAAAAAAAAAAAAGGGOGG 
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>PFOOOOl|PF00001 7 transmembrane receptor (rhodopsin family) 

Score: 184.21 Seq: 42 298 Model: 1 269 

*GNiLVTWvIcRyRRMRTPMNYFIvNLAvADLLFsl f tMPFWMvYyvMqg 
N+L + ++++ R+ ++ + +F+ N ++DLL+ ++T+PF +++ + G 
Flhl5625or 42 TNGLAMRI FFQ I RS - KSNFI I FLKNTVI S DLLM-I LTFPFKI LS DAKLG 



RWpFGdf MCr I WmYFDYMNMYASI F f LTc IS I DRYLWAICHPMrYmRWMT 
+ P+ +F+C +++ ++Y++MY SI FL +I+IDRY+ ++P++ + + 
Flhl5625or 89 TG PLRTFVCQVTSVTFYFTMY I S ISFLGLIT I DRYQ -KTTRPFKTSNPKN 

pRHRAWvMI iilWvMSFlISMPPFLMFrWs tyrDEneWNmTWCmlyDWPe 
+ A++++++IW+++FL+S+P + M+ + T+R ++ N+ C++ E 
Flhl5625or 13 8 LL-GAKILSV\n:WAFMFLLSLP-N-MI-L-TNRQPRDKNVKKCSF-LKSE 

. .v^rWYvII^tiimgFYIPMilMlFCYwRIYRIaRlWMRMIpswQrRR 
W +V ++ + F I ++I ++CY++I +++++ ++ +++ + 
Flhl 5 6 2 5or 182 FGLVWHEIVNYICQ-VI FWINFLIVIVCYTLITKELYRS YVRTRGVGK — 

rmSmRf ERRi vKMl i I IMWFI ICW1 PYF I vmf MDTLM . MwwFCe f C . I w 
++++ ++II+ VF+IC+ P++ + + +TL ++ ++ + 
Flhl5625or 229 VPRKKVNVKVFIIIAVFFICFVPFHFARIPYTLSQTRDVFDCTAEN 

rrlWmY . If eWLaYvNCpCiNPIIY* 
++++ ++WL ++N C++P+IY 
Flhl5625or 275 TLFYVKESTLWLTSLNA-CLDPFIY 298 



FIG 2 




C - 

TLB 1 8 2005 &j 




4/8 



c 
o 

CC 



c 

05 

E 

1 § 
<o jc 

O O 



c 

I 

o 
cc 



c 

to 

c 

. s 

1 ° 
a o 



8 

-D 

O 
CC 

a> 

E 
co 
0 



8 2 

o o 

. a> 

cc cc 

of <0 

£ .c 

a. a. 

< < 

B ft 



2 
o 



4 
o 

O 
« 

« 
c 
o 



O) O) 

Q) © 

OC CC 

jS s 

fl> CD 

co co 

£3 » 



c 

a (D 

CC a: 

E" £ 

3 3 



c 
o 
to 

o 



CO GC 
LL 



CD 
E 

CD 
i 

to 
c 
o 

*D> 
<D 
GC 

s 

m 



I 

8 
a 

I 

t 



S> 

CD 
X) 

c 

CD 

to 



to 
c 
o 

*0> 
CD 
GC 

o 
1c 

to 
o. 
1c 
a. 
E 
< 



2> 

CD 
X> 

c 

CD 

.52 

at 

I 

CO 

c 
o 
o> 

CD 

CC 



N 
3 

•5 

CO 

I 

CO 
3 

fr 

CO 



(O 
c 
o 



o 
IE 

a5 

t * 

E * 

U< £ 

cd - 5 

.c <o x 

Q- c5 «> 

< CO U- 

Q 0 EI 



c 
o 

CO 
CD 

E 

c0 



X 
CD 

■o 
c 

o 
c 

CD 

cp 
c 

< 



c 

E 
ai 

o 

X) 

o 



8 

CO 

3 
CO 



c 

s 
a 

> 
o 
Z 

< s 



rt o - 
« 6) ~ 
C 



Mill 



oo 



Pill 



2 u 



CO 



.2 b 

a z, 
§ 2 



J2 5> p. 



.2 Q 



£ < < 



in 
cm 

CO 

o 
o 

CO 

in 

CM 
O 

in 
in 

a 

o 
o 

OJ 

in 



o 
in 



in 
cnj 



o 
o 



in 



o 
m 



in 



mm 

0*3 



E3 



EI 



XI 



< < CD (D h h 




in 

-OJ 
CO 

o 
-o 

CO 

in 

CM 

o 
■in 

CM 



in 
•cm 

CM 

o 
■o 

CM 

in 



o 
■in 



in 
-cm 



o 
■o 



.m 



in 



CO 

CD 

LL 



m 



iro 

CO 



m 

CD 

to 

en 

ro 
o 

CD 
CJ1 



ZD 

m 
o 
m 

< 
m 

o 



f£B 1 8 2DQ5 w J 



Title: 1 5625 Receptor, A Novel G-Protein 

Coupled Receptor 
Inventors): Glucksmann et al. 
Application No: Not Assigned 
Atty Dkt No: 35800/238853(5800-13B) 



5/8 




FIG 4 





Coupled Receptor 
Inventors): Glucksmann et al. 
Application No: Not Assigned 
Atty Dkt No: 35800/238853(5800- 13B) 
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>PS00001|PDOC00001|ASN_GLYCOSYLATXON N-$rlycOsylation site. 
N[**P] [ST] [^*P] 

Query: 6 nits S 
Query: 13 ntsl 16 

>PSO0004|PDOC00004|CAKP_PKOSPHO_SITE cAMP- and cGMP -dependent protein kinas 
phosphorylation site. 
(RK]{2}[A-Z][ST] 

Query: 173 Wees 176 

>PS00O05 )PDOC0000S| PKCJPHOSPHO_SITE Protein kinase C phosphorylation site. 
(ST] [A-Z] [RK] 

Query: 126ttrl2 8 
Query: 163tmrl65 
Query: 304sfr306 

>PS00008|PDOC0O008|MYRISTVL N-myristoylation site. 
G[^*EDRKHPFYW] [A-Z] {2} [STAGCN] r*P] 

Query: 39 glitng 44 
Query : 3 3 3 ggdpne 338 
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Input file 79a2cons; Output File 79a2tra 
Sequence length 2272 

ACGCGTCCGCAATCTCTGATTGTAAAGCCCTCTCTTCCTCTCCTTCTATTTCTCTATAGAACACTCAAGACTTTACTGA 

TGAAAACTCAGGAAATTCTCTATCACAAAGAGGTTTGGCAACTAAACTAAGACATTAAAAGGAAAATACCAGAT 

TCTGCACGTTGCAATAACTACTATTTACTGGATACATTCAAATCCT 

MQAIDNLTSAPGNTSLCTR 19 
A ATG CAA GCC ATC GAC AAC CTC ACG TCT GCG CCT GGG AAC ACC AGT CTG TGC ACC AGA 57 



D 

GAC 


y 

TAC 


K 
AAA 


I 

ATC 


T 
ACC 


Q 
CAG 


V 
GTC 


L 
CTC 


F 
TTC 


P 
CCA 


L 
CTG 


L 

CTC 


Y 
TAC 


T 
ACT 


V 
GTC 


L 
CTG 


F 
TTT 


F 
TTT 


V 
GTT 


G 
GGA 


39 
117 


L 
CTC 


I 

ATC 


T 
ACA 


N 
AAT 


S 
AGC 


L 
CTG 


A 
GCG 


M 
ATG 


R 
AGG 


I 
ATT 


F 
TTC 


F 
TTT 


Q I 
CAA ATT 


R 
CGG 


S 
AGT 


K 
AAA 


S 
TCA 


N 
AAC 


F 
TTT 


59 
177 


I 
ATT 


I 
ATT 


F 
TTT 


L 
CTT 


K 
AAG 


N 
AAC 


T 

ACA 


V 
GTC 


I 
ATT 


S 
TCC 


D 
GAT 


L 

CTT 


L 

CTC 


M 
ATG 


I 

ATT 


L 

CTG 


T 
ACT 


F 
TTT 


P 

CCA 


F 
TTC 


79 
237 


K 
AAA 


I 

ATT 


L 
CTT 


S 
AGT 


D 
GAT 


A 
GCC 


K 
AAA 


L 
CTG 


G 
GGA 


T 
ACA 


G 
GGA 


P 
CCA 


L 
CTG 


R 
AGA 


T 
ACT 


F 
TTT 


V 
GTG 


C 
TGT 


Q 
CAA 


V 
GTT 


99 
297 


T 
ACC 


S 
TCC 


V 
GTC 


I 

ATA 


F 
TTT 


Y 
TAT 


F 
TTC 
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ACA 


M 
ATG 


Y 
TAT 
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ATC 
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AGT 
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ATT 


S 
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L 
CTG 
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L 
CTG 
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ATA 
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ACT 


119 
357 


I 

ATC 
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GAT 


R 
CGC 


Y 
TAC 
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CAG 
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AAG 
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ACC 
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ACC 


R 
AGG 


P 
CCA 


F 
TTT 


K 
AAA 


T 
ACA 


S 
TCC 


N 
AAC 


P 
CCC 


K 
AAA 


N 
AAT 
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CTC 
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TTG 


139 
417 


G 
GGG 
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GCT 


K 
AAG 
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ATT 
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CTC 
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TCT 
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GTT 
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CTC 
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ATC 


W 
TGG 
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ATG 
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CTC 
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TTG 


P 
CCT 
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AAC 


159 
477 


M 
ATG 
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ATT 
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CTG 


T 
ACT 
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AAC 


R 
AGG 
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CGG 


P 
CCA 
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AGA 
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GAC 


K 
AAG 


N 
AAT 


V 
GTG 


K 
AAG 


K 
AAA 


C 
TGC 


S 
TCT 


F 
TTC 


L 
CTT 


K 
AAA 


179 
537 


S 
TCA 


E 
GAG 


F 
TTC 


G 
GGC 


L 
CTA 


V 
GTC 


W 
TGG 


H 
CAT 


E 
GAA 


I 
ATA 


V 
GTA 
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AAT 
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I 
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TGG 


199 
597 


I 
ATT 


N 
AAT 


F 
TTC 


L 
TTA 
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65*7 
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GTA 
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GGT 
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AAA 
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GTC 
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AGG 
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AAA 
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AAG 


V 
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AAC 
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GTC 
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AAA 
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GTT 
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TTC 
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717 
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ATT 
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ATC 
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ATT 
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GCT 
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GTA 
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TTC 
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TTT 
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ATT 
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TGT 
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TTT 
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CCT 
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GCC 
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K E S T L W L 
AAA GAG AGT ACT CTG TGG TTA 

F L C K S F R 
TTC CTT TGC AAG TCC TTC AGA 

T S Q S Q D N 
ACA TCT CAG TCC CAG GAC AAT 



T S L N A C L 
ACT TCC TTA AAT GCA TGC CTG 

N S L I S M L 
AAT TCC TTG ATA AGT ATG CTG 

R K K E Q D G 
AGG AAA AAA GAA CAG GAT GGT 



D P F T Y F 299 

GAT CCG TTC ACC TAT TTT 897 

K C P N S A 319 

AAG TGC CCC AAT TCT GCA 957 

G D P N E E 339 

GGT GAC CCA AAT GAA GAG 1017 



T P M * 343 
ACT CCA ATG TAA 1029 

ACATATTAACTGAGGAAATATGTCAATCTCTTTGCGTTCAGAACTCATTAAAGC 
GACGAAGAAGCAACTGAGTTAATAACAATGACTCTTAAAACAT^ 

TTTCCAGTATGAAAAGCTATGTTAAAATATAGAAAACTAATCTAACCTGTAGCTGTATAGTATCAAAACAAATG 
CAATTGGCATGCTGCATGCAAAACTACACAGAATTC^CGTTTTGCAGAGT^ 

TACCGTAATGTTTAAAATA(^TTATTGCTCACGATTTTATTTCTTCATAATCAACTAAGGAAGAATTATCAATTGGATA 
CAATCTTCTTACAAAAAATGACACTTAAAATGTATATATATCCTAGCCXCTAACC 

ATAAAAATTTGAGTAAGTGGGATACACAAAGAATAATAACTATTAACTTTTAATTATGAGCAAAAACCTAA 
TTTAAACTAATTGAAACTGTATTTGATTGGACTTAATTTTTTTGTTTATTA^ 

TAAAGAGAAGAAATATCAAAGTCATTAAAATAAGGAGAGTTACTTTTATGATATTCTAACACTAAACAATATAGAAATA 
TTTCCTTAATATTAGTTTCTAGAGAAACTAGTTTTACTAATTTTTTACAACCTCAATAATACCATCATTGACACTTACC 
TTTATTAACTAGCTTCTAGAAAATACCTGCTAATTAGGTTAATGAACATTTTATGTTAGTGAAAAAAATTAATTAAATA 
TGATTACAAAGTTGCACAGCATAACTACTGAAAGTGATTGATCCATTTGTAATTATTTGTTTGTACTGGTGTGTATAAA 
ATACAAAATTTACATTAAACTCTAAATCACCAAAAAAAAAAAAAAAAAAAAGGGCGG 
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